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Abstract 



Quantum discord was proposed as an information theoretic measure of the "quantumness" 
of correlations. I show that discord determines the difference between the efficiency of 
quantum and classical Maxwell's demons in extracting work from collections of correlated 
quantum systems. 

Information has an energetic value: It can be converted into work. Maxwell's demon 1 
was introduced into thermodynamics to explore the role of information and, more generally, 
to investigate the place of "intelligent observers" in physics. In modern discussions of the 
subject 2 "intelligence" is often regarded as predicated upon or even synonymous with the 
information processing ability - with computing. Thus, Maxwell's demon is frequently 
modelled by a universal Turing machine - a classical computer - endowed with the ability 
to measure and act depending on the outcome. The role of such a demon is then to 
implement an appropriate conditional dynamics - to react to the state of the system as 
revealed through its correlation with the state of the apparatus. It is now known that 
quantum logic - i.e., logic employed by quantum computers - is in some applications more 
powerful than its classical counterpart. It is therefore intriguing to enquire whether a 
quantum demon - an entity that can measure non-local states and implement quantum 
conditional operations - could be more powerful than a classical one. I show that quantum 
demons can typically extract more work than classical demons from correlations between 
quantum systems, and that the difference is given by a variant of the quantum discord, 
recently introduced 3-5 measure of the "quantumness" of correlations. 

Quantum discord 3-5 is the difference between the two classically identical formulae 
that measure the information content of a pair of quantum systems. Several closely related 
variants can be obtained starting from the original definition 3 given in terms of the mutual 
information 6 . Mutual information is a measure of the strength of correlations between, 
say, the apparatus A and the system S: 



I(S : A) = H(S) + H(A) - H(S, A) 
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It measures the difference between the missing information about two objects when they 
are taken separately, 77(5) + H(A), and jointly, H(S,A). In the extreme case 5 and A 
are identical - e.g., two copies of the same book, or a state of the apparatus pointer A 
after a perfect but as yet unread measurement of 5. Then the joint entropy 77 (5, A) is 
equal to H(A) = 77(5), so 7(5 : A) = H(A). By contrast, when the two objects are not 
correlated, H(S,A) = 77(5) + H(A), and 7(5 : A) = 0. 

The other formula for mutual information employs classical identity for joint entropy 6 : 

77(5, A) = H(A) + H(S\A) = 77(5) + H(A\S) (2) 

Above H(S\A) is the conditional entropy - measure of the lack of knowledge about the 
system given the state of the apparatus. Substituting this in Eq. (1) leads to an asymmetric 
looking formula for mutual information: 

7(5 : A) = 77(5) + H(A) - [H(A) + H(S\A)} (3) 

We have refrained from carrying out the obvious cancellation above that would have yielded 
7(5 : A) = 77(5) — H(S\A) for a reason that will become apparent very soon. 
Discord is defined as: 

S(S\A) = 7(5 : A) - 7(5 : A) = [H(A) + H(S\A)} - 77(5, A) (4) 

Classically, of course, discord disappears as a consequence of Eq. (2). In quantum theory 
the situation is no longer this simple: In order to properly define the conditional entropy 
one must specify how the apparatus is "interrogated" about 5: Measurements modify the 
state of the pair. After a measurement of the observable with eigenstates {|Afc)} observer's 
own description of the pair is the conditional density matrix: 

PSA\A k ) = PS\A k ) ® \A k )(A k \ (5) 

He will attribute to the system Ps\A k ) with the probability PA(k) = Tr(Ak\psA\Ak)- The 
post measurement density matrix p' SA differs form the pre-measurement psa even for a 
bystander who has not yet found out the outcome. This point of view of the bystander 
differs from the viewpoint of the observer (or a demon) who made the measurement: Demon 
knows that the apparatus is in the state \A k ). Bystander obtains his post-measurement 
Psa by averaging over the outcomes. 

PSA = ^PA(k)p S \ Ak ) ® \A k )(A k \ (6) 
k 
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His description of the pair is unaffected by the measurement only when the measured 
observable commutes with p$ A . We shall find this bystander viewpoint very useful because 
it represents a statistical ensemble of all possible outcomes. 

In quantum physics one possible definition of joint entropy is inspired by Eq. (3): 

H A (S,A {lAk}} ) = [H(A)+H(S\A)] {lAk)} (7) 

where is the eigenbasis of the to-be-measured observable of the apparatus. An- 

other acceptable and completely quantum definition would be to simply compute the von 
Neumann entropy of the density matrix p$ A describing the joint state. Then: 

H(S,A) = -Tr PsA \gp s ,A = -Y,Psa{1)^Psa{1) (8) 

i 

where Psa(1) are the eigenvalues of psa ~ the probabilities of the density matrix that jointly 
describes the correlated pair. These eigenvalues always exist, but in general correspond to 
entangled quantum states in the joint Hilbert space of S and A. Such states cannot be 
found out through sequences of local measurements starting with just one subsystem of 
the pair - say, A. This is a fundamental difference between the quantum and the classical 
realm (where such "piecewise" investigation is always possible and need not disturb the 
state of the pair). It is responsible for non-zero discord. 

A simple example of this situation is a perfectly entangled state: 

|V'5^) = (|00) + |ll))/v / 2 (9a) 

Above, the first entry refers to S while the second corresponds to A. Clearly, psa = 
\iI>sa)(4'sa\ is pure - the pair is with certainty in the state \iPsa)- Hence, H(S, A) = 0. 
On the other hand, Pa(S) = l.A(S)/2, where 1 is the unit matrix in the appropriate Hilbert 
space, so that H(A) = H(S) = 1. Consequently, I(S : A) = 2, but the asymmetric mutual 
information is J(S : A) = 1. This is because the joint information H A (S, A{\ Ak y}) defined 
with reference to any measurement on a A, Eq. (5), is a sum of H (A) = 1 and H(S\A) = 0, 
with both of these quantities independent of the basis because of the symmtery of Bell 
states. 

Readers are invited to verify that a classical correlation in: 

p 5 ^ = (|00)(00| + |ll)(ll|)/2 (96) 
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results in zero discord, but only when the preferred basis = {|0), |1)} is employed. 

The entangled state of Eq. (9a) could be converted into the mixture of Eq. (9b) through 
decoherence in the preferred (pointer) basis 4 ' 8-11 or - and this is why decoherence can be 
regarded as monitoring by the environment - through a measurement with an undisclosed 
outcome carried out in the same pointer basis {|^4fe)} = {|0), |1)}. 

In general, the ignorance of the bystander cannot decrease (but may increase) as a 
result of a measurement of a known observable if he does not know the outcome 12 . Hence, 
H A {S,A {]Ak)} ) - H{S,A) > 0, and 

S(S\A {[Ak)} ) > (10) 

Equality occurrs only when p$ A remains uneffected by a partial measurement of 
on the A end of the pair. 

The relevance of the discord for the performance of Maxwell's demon can be now 
appreciated. Demons use the acquired information to extract work from their surroundings. 
The traditional scenario starts with an interaction establishing initial correlation between 
the system and the apparatus. The demon then reads off the state of A, and uses so 
acquired information about S to extract work by letting S expand throughout the available 
phase (or Hilbert) space of volume (dimension) d$ while in contact with the thermal 
reservoir at temperature T. This yields: 

W+ = k B2 T(lgd s -H(S\A)) (11) 

of work obtained at a price: 

W~ = k B2 TH(A) (12) 

Above, ks 2 is the Boltzmann constant adapted to deal with the entropy expressed in bits 
and T is the temperature of the heat bath. The net gain is then: 

W = k B2 T{\gd s - [H(A) + H{S\A)}) (13a) 

The price W~ is the cost of restoring the apparatus to the initial ready-to- measure state. 
The significance of this "cost of erasure" for the second law was pointed out in the seminal 
paper of Szilard 13 , but its relevance in the context of information processing was elucidated 
and codified by Landauer 14 . 

It is now accepted that neither classical 15-17 nor quantum 18-21 demons can violate 
the second law because of the cost of erasure. However, a demon with a supply of empty 
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memory (used to store measurement outcomes) can extract, on the average, W + of work 
per step from a thermal reservoir. This strategy works, because, in effect, demon is using 
its memory as a reservoir with low entropy. However, a new block of empty memory 
of size d_A is used up with each new measurement. This is wasteful, and only fradulent 
accounting (uncovered by Szilard and Landauer) which ignores the thermodynamic value 
of empty memory can create an appearance of the violation of the second law. 

To optimize performance demon should use memory of A more efficiently. The obvious 
strategy here is to compress the bits of the outcomes after a sequence of measurements, 
freeing up an unused block of length Afi. The data can be compressed to the size given 
by their algorithmic complexity 17 . The savings are: 

A/jL = ]gd A -K(A k ) 

Where K{A k ) is the algorithmic randomness (Kolmogorov complexity) per step. Moreover, 
one can show that for long sequences of data the approximate equality: 

(K(A k )) ~ H(A) 

becomes exact, so that the saved up memory is on the average: 

A» = \gd A -H(A) 

Maxwell's demon can attain net work gain per step of: 

W = k B2 T(\gd s d A - [H{A) + H(S\A)}) (136) 

When S and A are classically correlated so that Eq. (2) applies, this can be written as: 

W = k B2 T(\gd s d A -H(S,A)) (13c) 

We note that the efficiency is ultimately determined by the joint entropy of S and A 
accessible to the demon, and that the same equation would have followed if we simply 
regarded the SA pair as a composite system, and the demon used it all up as a fuel. 

The efficiency of demons is then determined by what they know about the pair SA 

- its joint entropy - and we have already seen that in quantum physics joint entropy 
depends on how the information about the pair can be acquired. A classical demon is local 

- it operates one-system-at-a-time on the correlated a quantum pair SA. In this case the 
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above sketch of the "standard operating procedure" applies with one obvious caveat: It 
needs to be completed by the specification of the basis demon measures in A. The cost of 
erasure is still given by Eq. (12), also for classical demons extracting work from quantum 
systems 11-13 . Thus: 

W c /k B2 T = IgdsA ~ MA) + H(S\A)] {]Ak)} (14) 

The only difference between the classical Eq. (10) and the quantum Eq. (11) is the obvious 
dependence on the basis (|Afc)} demon selects to measure. The expression in the square 
brackets is the measure of the remaining (conditional) ignorance and of the cost of erasure. 
We shall be interested in the that maximize W c . 

A quantum demon can typically extract more work - get away with lower costs of 
erasure - because its measurement can be carried out in a basis that avoids increase of 
entropy associated with measurements and decoherence 4 ' 8-11 ' 23 : It can always select a 
global basis in the combined Hilbert space of SA that commutes with the initial psa- The 
work that can be extracted after the apparatus gets reset to its initial ready-to-measure 
pure state is: 

W Q /k B2 T = IgdsA ~ MS, A) (15) 

The other straightforward way to arrive at Eq. (15) is to use quantum demon in its 
capacity of a universal quantum computer, which, by definition, can transform any state 
in the Hilbert space into any other state. This will, in particular, allow the demon to 
evolve entangled eigenstates of an arbitrary known psa into product states. The resulting 
density matrix can be measured in a local basis that does not perturb its eigenstates, and, 
hence, as viewed by the bystander, it will not suffer any additional increase of entropy. The 
work extracted by the optimal quantum demon is limited simply by the basis-independent 
joint entropy of the initial psa- 

The difference between the efficiency of the quantum and classical demons can be now 
immediately computed: 

AW/k B2 T = [H(A) + H(S\A)] {]Ak)} - H(S, A) (16) 

or: 

AW = k B2 T5 A (S\A {]Ak)} ) (17) 

Equation (17) relating the extra work AW to discord defined as the difference of the acces- 
sibe joint entropy of classical (local) and quantum (global) demons is the principal result 
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of our paper. It answers an interesting physics question while simultaneously providing an 
operational interpretation of the discord. 

To gain further insight into implications of the above discussion, let us first note that 
discord is in general obviously basis dependent. Discord disappears iff the density matrix 
has the "post-decoherence" (or "post- measurement" ) form, Eq. (6), already before the 
measurement. Given the ability of classical demons to match the quantum performance 
standard in this case, basis that allowes for the disappearance of discord in the 

presence of non-trivial correlation can be justifiably deemed classical. We note that the 
Psa °f the locally diagonal form presented above may emerge as a consequence of the 
coupling of A with the environment 8-11 . The preferred pointer basis emerges as a result 
of einselection. 

A typical psa does not have the form of Eq. (5), however. In that case discord does 
not completely disappear for any basis, and is usually basis-dependent. It is therefore of 
interest to enquire about the basis that yields the least discord, 8(S\A). This leads us back 
to the ambiguity in the definition of the discord: we could adopt either: 

5(S\A)=min {lAk)} [H(A)+H(S\A)] {lAk)} -H(SA) (18) 

or 

d(S\A) = H(A) + min {lAk)} H(S\A) {lAk)} - H(SA) (19) 

The difference between them is obvious, and <5(<S|^4) > d(S\A). 

Discord is not symmteric between the two ends of the correlation: In general, 

S(SA) ^ 5{SA) (20) 

In particular, for density matrices that emerge from S(S\A) may vanish but S(A\S) may 
may remain finite. Such correlations are one-way classically accessible. They are charac- 
terised by a preferred direction - from A to S - in which more information about the joint 
state can be acquired. Thus, a local demon that can choose between the two "ends" of the 
SA pair may be in some cases more efficient than a one-way demon. Indeed, one could 
define a polarization 

w{S\A) = 6{S\A) - 6{A\S) (21) 

to quantify this directionality. 

One can generalise discord to situations involving collections of correlated systems. 
The obvious strategy is to define it as a difference between the joint entropy corresponding 
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to a particular sequence of (possibly conditional) measurements - that is, the obvious 
generalisation of Eq. (7) - and the joint von Neumann entropy of the unmeasured density 
matrix. One could define a minimal discord of a collection of systems as a minimum over 
all possible sequences of measurements. This corresponds to the demon having a choice of 
the end of the pair it can measure first. 

First hints of the quantum underpinnings of the Universe emerged over a century 
ago in a thermodynamic setting involving black body radiation. We have studied here 
implications of quantum physics - and, in particular, of the quantum aspects of correlations 
- for classical and quantum Maxwell's demons. We have seen that discord is a measure 
of the advantage afforded by the quantum conditional dynamics, and shown that this 
advantage is eliminated in presence of decoherence and the ensuing einselection. Our 
discussion sheds a new light on the problem of transition between quantum and classical: 
It leads to an operational measure of the quantum aspect of correlations. As was already 
pointed out 4 , the aspect of quantumness captured by discord is not the entanglement. 
Rather, it is related to the degree to which quantum superpositions are implicated in a 
state of a pair or of a collection of quantum systems. We expect it to be relevant in 
questions involving quantum theory and thermodynamics (see e.g. Ref. 24), but discord 
may be also of use in characterising multiply correlated states that find applications in 
quantum computation. 

References 

1. Maxwell, J. C. Theory of Heat, 4th ed., pp. 328-329 (Longman's, Green, & Co., 
London 1985). 

2. Leff H. S. and Rex, A. F. Maxwell's Demon: Entropy, Information, Computing, 
(Princeton University Press, Princeton, 1990). 

3 Zurek, W. H. Annalen der Phystk9, 855 (2000). 

4 Zurek, W. H. Decoherence, einselection, and the quantum origin of the classical, Rev. 
Mod. Phys. (in the press), |quant-ph 0105127| (2001) 

5 Ollivier, H. and Zurek, W. H. Phys. Rev. Lett. (2002). 

6 Cover, T. M. and Thomas, J. A. Elements of Information Theory (Wiley, New York, 
1991). 

7 Fuchs, C. and Peres, A. Physics Today 53, 70 (2000). 

8 Zurek, W. H., Pointer basis of a quantum apparatus: Into what mixture does the 
wavepacket collapse? Phys. Rev. D 24, 1516-1524 (1981). 



8 



9 Zurek, W. H., Decoherence and the transition from quantum to classical Physics Today 
44, 36-46 (1991). 

10 Giulini, D., Joos, E., Kiefer, C, Kupsch, J., Stamatescu, I.-O., and Zeh, H. D., 
Decoherence and the Appearance of a Classical World in Quantum Theory, (Springer, 
Berlin, 1996). 

11 Paz, J. P., and Zurek, W. H., Environment-induced decoherence and the transition 
from quantum to classical, Les Houches Lectures (2000). 

12 Klein, O. Z. Phys. 72, 767-775 (1931). 

13 Szilard, L. Z. Phys. 53 840 (1929). English translation in Behav. Sci. 9, 301 (1964), 
reprinted in Quantum Theory and Measurement, edited by J. A. Wheeler and W. H. 
Zurek (Princeton University Press, Princeton, 1983); Reprinted in Ref. 2. 

14 Landauer, R. IBM J. Res. Dev. 3, 183 (1961). 

15 Bennett, C. H. Int. J. Theor. Phys. 21, 905 (1982). 

16 Bennett, C. H. Sci. Am. 255 (11), 108 (1987). 

17 Zurek, W. H. Nature 347, 119-124 (1989); Phys. Rev. A40, 4731-4751 (1989). 

18 Zurek, W. H. Maxwell's demon, Szilard's engine, and quantum measurements, pp. 
151-161 in Frontiers of Nonequilibrium Statistical Physics, G. T. Moore and M. O. 
Scully, eds., (Plenum Press, New York, 1986); reprinted in Ref. 2. 

19 Lloyd, S. Quantum-mechanical Maxwell's demon Phys. Rev. A56, 3374-3382 (1997). 

20 Nielsen, M. A., Caves, C. M., Schumacher, B., and Barnum, H. Information theoretic 
approach to quantum error correction and reversible measurement, Proc. Roy. Soc. 
London A454, 277-304 (1998). 

21 Zurek, W. H., Algorithmic randomness, physical entropy, measurements, and the 
demon of choice, pp 393-410 in Feynman and Computation, A. J. G. Hey, ed., (Perseus 
Books, Reading, 1999). 

22 Li, M. and Vitanyi, P. An Introduction to Kolmogorov Complexity and its Applications 
(Springer, Berlin, 1993). 

23 Zurek, W. H. Information transfer in quantum measurements, pp. 87-116 in Quantum 
Optics, Experimental Gravity, and the Measurement Theory, P. Meystre and M. O. 
Scully, eds. (Plenum, New York, 1983). 

24 Horodecki, R., Horodecki, M., and Horodecki, P. Phys. Rev. A63 022310 (2001). 



9 



